# wav2vec2 Fine-tuning
Deepfake Audio Detection V1
Apache-2.0
A deepfake audio detection model fine-tuned based on wav2vec2-base, achieving 99.66% accuracy
Audio Classification
Transformers

D
Zeyadd-Mostaffa
33
0
Wav2 Noise
Apache-2.0
A noise recognition model fine-tuned from facebook/wav2vec2-base with 93.89% accuracy
Audio Classification
Transformers

W
zylin12
1
0
My Awesome Mind Model
Apache-2.0
An audio classification model fine-tuned on the minds14 dataset based on facebook/wav2vec2-base
Audio Classification
Transformers

M
faaany
1
0
Wav2vec2 Large Lv60 Phoneme Timit English Timit 4k 002
Apache-2.0
A fine-tuned English phoneme recognition model based on facebook/wav2vec2-large-lv60 on the TIMIT dataset, achieving a phoneme error rate of 10.53%
Speech Recognition
Transformers English

W
excalibur12
103
1
Speechbrain Emotion Recognition Openvino
Apache-2.0
This model uses a fine-tuned wav2vec2 (base) architecture, trained on the IEMOCAP dataset for speech emotion recognition tasks.
Audio Classification English
S
psakamoori
13
0
Wav2vec Base Crema Sentiment Analysis
Apache-2.0
A speech emotion analysis model fine-tuned based on facebook/wav2vec2-base, achieving 70.87% accuracy on the evaluation set
Audio Classification
Transformers

W
Piyush2512
38
0
Wav2vec2 Base Arabic Speech Emotion Recognition
Apache-2.0
A fine-tuned Arabic speech emotion recognition model based on facebook/wav2vec2-base, achieving 99.92% accuracy on the evaluation dataset.
Audio Classification
Transformers

W
ahmmedasaad2772
352
0
Wav2vec2 Large Xlsr 53 English Finetuned Ravdess
Apache-2.0
A speech emotion recognition model fine-tuned on the RAVDESS dataset based on the wav2vec2-large-xlsr-53-english model
Audio Classification
Transformers

W
firdho26
68
0
My Awesome Mind Model
Apache-2.0
An audio classification model fine-tuned based on facebook/wav2vec2-base, achieving 58.92% accuracy on the evaluation set
Audio Classification
Transformers

M
Krithika-p
15
0
Wav2vec2 Audio Emotion Classification
Apache-2.0
A fine-tuned audio emotion classification model based on facebook/wav2vec2-base, achieving 73.98% accuracy on the evaluation set
Audio Classification
Transformers

W
chin-may
77
5
Wav2vec2 Base Music Speech Both Classification Finetuned Gtzan
Apache-2.0
Audio classification model based on wav2vec2 architecture, fine-tuned on the GTZAN dataset for music and speech classification tasks
Audio Classification
Transformers

W
0bi0n3
15
1
Wav2vec2 Base Finetuned Gtzan
Apache-2.0
This model is an audio classification model fine-tuned on the GTZAN dataset based on facebook/wav2vec2-base, primarily used for music genre classification tasks.
Audio Classification
Transformers

W
wilson-wei
14
0
Finetuned Wav2vec2.0 Base On IEMOCAP 2
Apache-2.0
This is a speech emotion recognition model based on the facebook/wav2vec2-base model fine-tuned on the IEMOCAP dataset, achieving 73.9% accuracy on the evaluation set.
Audio Classification
Transformers

F
minoosh
32
2
CREMA D Model
Apache-2.0
A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving 73.22% accuracy on the evaluation set
Audio Classification
Transformers

C
jdmartinev
21
0
Bird Classification Model
Apache-2.0
An audio classification model fine-tuned based on facebook/wav2vec2-base for identifying bird sounds
Audio Classification
Transformers

B
Saads
19
1
Iewav2vec2 Finetuned On Shemo
Apache-2.0
This model is a fine-tuned version of minoosh/wav2vec2-base-finetuned-ie on the shEMO dataset, primarily used for speech emotion recognition tasks.
Audio Classification
Transformers

I
minoosh
20
0
Wav2vec2 Base Speech Emotion Recognition
Apache-2.0
A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, used to predict the speaker's emotions in audio samples.
Audio Classification
Transformers English

W
DunnBC22
128
13
Audio Class Finetuned
Apache-2.0
This model is a fine-tuned audio classification model based on facebook/wav2vec2-base on the superb dataset, achieving an accuracy of 0.6578 on the evaluation set.
Audio Classification
Transformers

A
Chemsseddine
20
0
Wav2vec2 Base Finetuned Ks
Apache-2.0
A speech recognition model fine-tuned on the superb dataset based on facebook/wav2vec2-base, achieving 98.34% accuracy
Speech Recognition
Transformers

W
marcatanante1
13
0
Ser Model Fixed Label
Apache-2.0
A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving an accuracy of 83.67% on the evaluation set
Audio Classification
Transformers

S
aherzberg
18
1
Englishspeechtotext
Apache-2.0
Fine-tuned English speech recognition model based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers

E
Foxasdf
24
1
Wav2vec2 Base Finetuned Ks
Apache-2.0
This model is a speech recognition model fine-tuned on the SUPERB dataset based on facebook/wav2vec2-base, demonstrating excellent performance in keyword spotting tasks.
Speech Recognition
Transformers

W
teoha
14
0
Wav2vec2 Large Emotion Detection German
Apache-2.0
A German speech emotion detection model based on wav2vec2, trained on the emo-DB dataset, capable of recognizing 7 different emotions.
Audio Classification
Transformers German

W
padmalcom
20
3
Wav2vec2 Base Finetuned Ks
Apache-2.0
A speech recognition model fine-tuned on the superb dataset based on wav2vec2-base, achieving 98.15% accuracy
Audio Classification
Transformers

W
ngeg2015
14
0
Wav2vec2 Base Intent Classification Ori F1
Apache-2.0
This model is a speech intent classification model fine-tuned from facebook/wav2vec2-base, achieving an F1 score of 0.875 on the evaluation set.
Audio Classification
Transformers

W
MuhammadIqbalBazmi
14
0
Wav2vec2 Large 960h Intent Classification Ori
Apache-2.0
Fine-tuned intent classification model based on facebook/wav2vec2-large-960h, achieving 77.08% accuracy on the evaluation set
Audio Classification
Transformers

W
MuhammadIqbalBazmi
15
0
Wav2vec2 Base Intent Classification Ori
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-base on the intent-dataset for intent classification tasks.
Audio Classification
Transformers

W
MuhammadIqbalBazmi
18
1
My Awesome Minds Model
Apache-2.0
A speech recognition model fine-tuned on the minds14 dataset based on facebook/wav2vec2-base
Speech Recognition
Transformers

M
stevhliu
107
0
Urdu Audio Emotions
Apache-2.0
A fine-tuned Urdu audio emotion classification model based on facebook/wav2vec2-large-xlsr-53, supporting recognition of four emotions: anger, happiness, calmness, and sadness.
Audio Classification
Transformers

U
Talha
66
15
Wav2vec2 Base Timit Demo Colab
Apache-2.0
A speech recognition model fine-tuned on the TIMIT dataset based on the facebook/wav2vec2-base model, featuring a low Word Error Rate (WER).
Speech Recognition
Transformers

W
nawta
96
1
Wav2vec2 Base Timit Demo Google Colab
Apache-2.0
A speech recognition model fine-tuned on the TIMIT dataset based on facebook/wav2vec2-base, specializing in English speech-to-text tasks
Speech Recognition
Transformers

W
dasolj
127
0
Wac2vec Lllfantomlll
Apache-2.0
A speech recognition model fine-tuned based on facebook/wav2vec2-base, achieving a word error rate of 0.3417 on the evaluation set.
Speech Recognition
Transformers

W
lllFaNToMlll
27
0
Wav2vec2 Base Timit Demo Colab53
Apache-2.0
A speech recognition model fine-tuned based on facebook/wav2vec2-base, suitable for the TIMIT dataset
Speech Recognition
Transformers

W
Mudassar
22
0
Wav2vec2 Final 1 Lm 2
Apache-2.0
A fine-tuned speech recognition model based on facebook/wav2vec2-base, with a Word Error Rate (WER) of 0.283, and 0.126 when using 3-gram
Speech Recognition
Transformers

W
chrisvinsen
15
0
One Simple Finetune Test
Apache-2.0
This model is a fine-tuned version of RuiqianLi/wav2vec2-large-xls-r-300m-singlish-colab based on the li_singlish dataset, primarily used for Singapore English speech recognition tasks.
Speech Recognition
Transformers

O
RuiqianLi
28
0
Wav2vec2 Base Timit Demo Google Colab
Apache-2.0
This model is a speech recognition model fine-tuned on the TIMIT dataset based on facebook/wav2vec2-base, focusing on English speech-to-text tasks.
Speech Recognition
Transformers

W
wrice
17
0
Filipino Wav2vec2 L Xls R 300m Official
Apache-2.0
A speech recognition model fine-tuned on Filipino speech datasets based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers

F
Khalsuu
1.2M
1
Wav2vec2 Base Timit Demo Colab53
Apache-2.0
A speech recognition model fine-tuned on the TIMIT dataset based on the facebook/wav2vec2-base model, primarily used for English speech-to-text tasks.
Speech Recognition
Transformers

W
hassnain
16
0
Wav2vec2 Base Timit Demo Colab92
Apache-2.0
A speech recognition model fine-tuned on the TIMIT dataset based on the facebook/wav2vec2-base model
Speech Recognition
Transformers

W
hassnain
16
0
Wav2vec2 Base Timit Demo Colab50
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base, trained for 30 epochs on the TIMIT dataset.
Speech Recognition
Transformers

W
hassnain
16
0
- 1
- 2
Featured Recommended AI Models